Week 11.2 - Speculative Futures: A Reading Guide

🎯 What We'll Cover

Sub-Lesson 11.1 deliberately stayed inside the present and the very near term — results that are already shipping or that look highly likely to land within the next year, and that you can therefore actually check against primary sources today. This sub-lesson takes the same calibrated reading habit and applies it to the genuinely speculative end of the AI-in-research literature — the work that asks not “what does AI do for science now?” but “what could AI do for science, eventually, and what would that change?” Five years out is unknowable territory; the framework of 11.1 stops well short of it deliberately.

The calibration habit from 11.1 still applies here — only more strongly. Confident sentences about 2030 are rarely better-grounded than confident sentences about 2026. Several of the works in this guide are serious academic frameworks; a couple are essentially structured science fiction with a confident tone. The lesson is organised so you can tell them apart.

The goal is not to make you a forecaster. It is to give you a working reading guide for the question “what kind of researcher do I want to be in a world where this stuff keeps moving?” — which is the question that quietly underlies everything else in this week.

🧬 A. Frameworks for Thinking About AI in Science

If you only read one piece of speculative-futures work, make it one of these three. They are the conceptual scaffolding most subsequent debates are arguing inside.

The three dimensions of AI in understanding (Krenn et al., 2022)

Krenn and colleagues, writing in Nature Reviews Physics, set out a three-part framework for how AI can contribute to scientific understanding: as a computational microscope (observing things humans cannot), as a resource of inspiration (a muse for hypotheses), and as an agent of understanding — an AI that itself understands what it has found.

The paper itself classifies the third dimension as “the ultimate, not yet existent” capability. Read it for the calibrated framing as much as for the framework: this is what a serious speculative paper sounds like.

Krenn, M. et al. (2022). On scientific understanding with artificial intelligence. Nature Reviews Physics 4, 761–769. arXiv:2204.01467.

The AI-in-discovery landscape (Wang et al., 2023)

Wang and colleagues, in Nature, map AI's emerging role across every stage of the scientific process: hypothesis generation, experiment design, data collection, analysis, interpretation. The paper is a landscape, not a manifesto — useful for orienting yourself before reading any of the more polemical speculative work.

Notable for being honest that “AI in science” is not one thing: protein structure prediction, materials discovery, automated chemistry, and LLM-assisted writing all sit at different levels of maturity and need to be evaluated separately.

Wang, H., Fu, T., Du, Y. et al. (2023). Scientific discovery in the age of artificial intelligence. Nature 620, 47–60. DOI 10.1038/s41586-023-06221-2.

Levels of AGI (Morris et al., 2023)

A DeepMind team led by Meredith Ringel Morris — with Shane Legg among the authors — proposes a six-level framework for AGI capability, indexed jointly by performance (depth) and generality (breadth), explicitly analogous to the SAE levels of autonomous driving. The framework deliberately avoids putting a date on any level.

Useful precisely because it refuses to commit to a timeline. When you read confident claims about “AGI by 2027”, this paper is what such claims are pretending to be.

Morris, M. R. et al. (2023). Levels of AGI: Operationalizing Progress on the Path to AGI. arXiv:2311.02462.

📊 B. Concrete, Falsifiable Forecasts

A useful filter for speculative work: does the author commit to a number, on a defined timescale, that someone could check? Most speculative writing about AI does not. The few works that do are worth reading carefully, because they are the parts of the discourse that can actually be wrong in the ordinary scientific sense.

⏱️ The METR “doubling-every-seven-months” study

In March 2025, the AI-evaluation lab METR published a paper introducing a deceptively simple metric: the 50% time horizon, defined as the length of a software task an AI system can complete with at least 50% success. Over the six years from 2019 to 2025, the 50% horizon has doubled approximately every seven months.

The endpoints of the curve in the March 2025 paper are striking: from GPT-2 in 2019 (a 50% horizon of seconds) up to Claude 3.7 Sonnet in early 2025 (around 50 minutes). METR maintains an updated “Time Horizon” dataset with newer models and a larger task suite; for current figures, check METR's own tracker directly rather than the often-inflated numbers that circulate in secondary news coverage — a small live demonstration of this lesson's central habit.

Extrapolated naively, the curve says that within a decade AI systems will be able to autonomously complete the kind of software work that today takes a person days or weeks. The honest caveat the METR team itself flags is that exponential curves bend — nothing in nature stays exponential forever — and they are deliberately not predicting when the bend will happen.

Why read it: this is the cleanest example of a falsifiable forecast in the speculative-futures literature. By 2028 we will know whether the 7-month doubling held; that is more than can be said for most predictions in this area. METR (March 2025). Measuring AI Ability to Complete Long Software Tasks. arXiv:2503.14499.

⚠️ What an extrapolation can't tell you

A doubling time is not the same thing as a model of why the doubling happens. The METR paper documents the curve; it does not explain it. The curve could continue. It could also bend tomorrow if, say, the reliability ceiling on long-horizon agents (Week 10.2) turns out to be the real bottleneck. Treat extrapolation curves the way you would treat any time-series forecast in your own field: useful as a starting estimate, dangerous as a basis for confident predictions about specific dates.

Adjacent to METR is the body of work from Epoch AI — an organisation focused on training-compute scaling, dataset-size trends, and the economics of frontier model training. Their forecasts share METR's strength (quantitative and refutable) and weakness (extrapolation without a mechanism). Worth reading if you want the technical version of “how big might AI training get?”

🏘️ C. Institutional Visioning

This category contains some of the most quietly important speculative-futures work, because it is what the bodies that set rules — learned societies, national academies, intergovernmental panels — think research is going to look like. These documents move slowly and read carefully, but they are the speculative pieces that translate most directly into actual policy.

📚 Science in the age of AI (Royal Society, May 2024)

The Royal Society's 2024 working-group report (ISBN 978-1-78252-712-1) draws on interviews with more than 100 scientists across disciplines plus a working group of experts. It frames AI as transforming the nature, method, and integrity of scientific research, and is unusually careful in its scepticism: a central recommendation is that overdependence on “opaque” AI systems could undermine the reliability of scientific findings and the public's trust in them.

Read it for its tone. This is what speculative-futures discourse sounds like when written by a body that takes responsibility for science as an institution rather than as a market.

royalsociety.org/.../science-in-the-age-of-ai

In a similar register is the Africa Declaration on AI (April 2025), signed at the conclusion of the Global AI Summit on Africa by almost every AU member state. Its institutional speculation is the proposal for an African AI Scientific Panel — a regional body of researchers from Africa and the diaspora to provide evidence-based research on AI risks and opportunities. Whether the panel materialises in the form proposed will be a useful early test of how much of the continental-strategy rhetoric (covered in 11.5) translates into operational reality.

🚀 D. Wild Speculation

This bucket contains the most ambitious and least-grounded work in the field. The works listed below range from serious academic research that is openly speculative to documents that are essentially structured science fiction with a confident tone. They are listed not because we endorse them but because they are influential enough that you will encounter them, and being able to read them critically is part of the disposition this week is asking you to build.

Open-endedness and AI-Generating Algorithms (Clune, 2019)

Jeff Clune — one of the co-authors of the Sakana AI Scientist paper covered in 11.1 — proposes that the most plausible route to general-purpose AI is to build systems that invent better AI systems, in an open-ended evolutionary process. The 2019 paper (arXiv:1905.10985) is the foundational reference for the lineage of automated-science work, of which the Sakana AI Scientist is a recent product.

This is serious academic research that is honest about its speculative character. Useful to read for the intellectual history alone.

Clune, J. (2019). AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence. arXiv:1905.10985.

AI 2027 scenario (Kokotajlo et al., April 2025)

A 71-page month-by-month scenario, written by Daniel Kokotajlo's AI Futures Project, of how the period 2025–2027 might play out in capabilities and geopolitics. The scenario predicts “superhuman coders” by March 2027 and “superhuman AI researchers” by June 2027.

Two honest notes. The authors themselves estimate roughly 50% probability that the superhuman-coder milestone is missed on the 2027 timeline, and Kokotajlo's own median forecast has since shifted to “around 2030, lots of uncertainty though”. Critics describe parts of the scenario as closer to science fiction than forecasting. Worth reading as the cleanest specimen of confident-tone speculation in the field — the kind of document the reading habit from 11.1 is most useful against.

AI Futures Project, AI 2027, April 2025.

Human Compatible and the alignment literature (Russell, 2019)

Stuart Russell's book lays out the speculative-but-serious case that controlling capable AI systems — getting them to do what humans actually want, rather than what we literally asked — is a research problem in its own right. The book has become foundational reading for the AI safety / alignment literature, which now has its own academic conferences and a growing presence in policy discussions.

Worth reading because the alignment conversation is one of the few places where speculative AI futures translate directly into institutional and regulatory action.

Russell, S. (2019). Human Compatible: Artificial Intelligence and the Problem of Control. Viking.

The counter-position: AI as “normal technology” (Narayanan & Kapoor, 2025)

Almost everything else in this bucket argues, in one register or another, that AI is on a steep trajectory toward transformative or general capability. The most prominent serious work arguing the opposite is Arvind Narayanan and Sayash Kapoor's essay “AI as Normal Technology” — the authors of the AI Snake Oil book and newsletter (whom you met in Week 7). They argue that AI is best understood like electricity or the internet: genuinely transformative over decades, but diffusing through the economy at the ordinary, friction-bound pace of any general-purpose technology — not as an imminent “superintelligence” discontinuity.

It is included here deliberately as the calibration counterweight: a reading guide that only lists escalating-capability speculation is itself miscalibrated. Read it against AI 2027 in particular — same near-future, opposite priors — and notice which of the two gives you more ways to check it later.

Narayanan, A. & Kapoor, S. (2025). AI as Normal Technology. Knight First Amendment Institute, 15 April 2025.

The frontier-vendor speculation lane

Senior figures at the major AI labs publish their own speculative-futures pieces with some regularity: Sam Altman's essays, Dario Amodei's “Machines of Loving Grace” (October 2024), Demis Hassabis's interviews. These are not academic work; they are speculative essays by people with strong commercial interests in particular futures coming true. They are also genuinely influential on policy and public understanding.

Read them as primary sources for “what frontier labs publicly believe (or wish to be seen believing) about the future”, not as predictions.

Various vendor essays, 2024–2026.

🌍 A Note on the African Gap

The serious speculative-futures literature is, by and large, written from a small number of institutions in the Global North. Of the works in this guide, none are authored from an African research institution; the institutional visioning that comes closest is the Africa Declaration on AI, which is a political document rather than a research one.

This is itself a research opportunity. The closest existing African work in this register sits in the literature we covered in Weeks 4 and 11.5:

Mhlambi (2020), From Rationality to Relationality — argues for ubuntu as a foundation for AI ethics, with implications for what a non-Western AI research culture would look like.
Effoduh (2026), “Decolonizing the governance of artificial intelligence in Africa” (Science and Public Policy 53(2), 245–257) — develops the concept of epistemic sovereignty as a speculative-normative goal for African AI work.
Nyabola (2026), “Foundations for African feminism as an ethics for artificial intelligence” (Science and Public Policy 53(2), 277–288) — makes the case that a genuinely African AI research tradition would need to start from different onto-epistemological commitments, not just import Northern AI tools and add an “African values” layer on top.

If you find the speculative-futures conversation interesting, one of the most useful contributions you could make as an African postgraduate researcher is to add to this side of the literature. The frame the Effoduh and Nyabola papers develop is genuinely under-applied to questions about AI research as opposed to AI governance, and there is room for serious work that explores what an African vision of AI in science would look like, beyond importing the Northern speculative literature wholesale.

💡 The reading habit, restated for speculative work

When you read a speculative-futures piece — in this guide or elsewhere — the questions worth asking are not so different from the ones 11.1 set out. Is there a number the author commits to that someone could check? Is the speculation grounded in a mechanism or in a graph of past trends? Does the author take responsibility for being wrong, or do they reserve the right to retrofit their predictions?

If a piece of futures work would not change anything its author does if it turned out to be wrong, it is closer to fiction than forecasting. That is not a reason to ignore it — serious fiction can be useful — but it is a reason to read it differently from a piece of work the author would defend on the same terms as their other research.

✏️ An Optional Exercise

If you want to put the calibration habit through its paces:

Pick one work from buckets A, B, C, or D above that you have not previously read.
Read it, with particular attention to where in the text the author signals the limits of their own claim. (Look for words like “might”, “could”, “under the assumption that”, and for hedged probability language.)
Locate one specific claim in the work that you could imagine checking in 2030 — a date, a number, a milestone, a behaviour. Write it down in a sentence.
Write a second sentence describing what evidence would persuade you the claim was wrong.

That second sentence — what would change your mind — is the centre of the calibrated reading habit. A speculative work that gives you no way to check it later is a work you should hold loosely. One that does is a work you can engage with on the same terms as any other piece of research.

📚 Full Reference List

📄 Primary sources used in this guide

Krenn, M. et al. (2022). On scientific understanding with artificial intelligence. Nature Reviews Physics 4, 761–769. arXiv:2204.01467 · Nature page.

Wang, H. et al. (2023). Scientific discovery in the age of artificial intelligence. Nature 620, 47–60. DOI 10.1038/s41586-023-06221-2.

Morris, M. R. et al. (2023). Levels of AGI: Operationalizing Progress on the Path to AGI. arXiv:2311.02462.

METR (March 2025). Measuring AI Ability to Complete Long Software Tasks. arXiv:2503.14499. Blog post: metr.org.

Royal Society (May 2024). Science in the age of AI. ISBN 978-1-78252-712-1. Project page.

Africa Declaration on AI (April 2025). Adopted at the Global AI Summit on Africa. (Covered in detail in 11.5.)

Clune, J. (2019). AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence. arXiv:1905.10985.

Kokotajlo, D. et al. (April 2025). AI 2027 scenario. AI Futures Project. (Available via the project's website; treat as the prominent specimen of structured speculative-futures scenario planning, not as forecast.)

Narayanan, A. & Kapoor, S. (2025). AI as Normal Technology. Knight First Amendment Institute, 15 April 2025. knightcolumbia.org.

Russell, S. (2019). Human Compatible: Artificial Intelligence and the Problem of Control. Viking.

Effoduh, J. O. (2026). Decolonizing the governance of artificial intelligence in Africa. Science and Public Policy 53(2), 245–257. DOI 10.1093/scipol/scag005.

Nyabola, N. (2026). Foundations for African feminism as an ethics for artificial intelligence. Science and Public Policy 53(2), 277–288. DOI 10.1093/scipol/scag009.

Mhlambi, S. (2020). From Rationality to Relationality: Ubuntu as an Ethical and Human Rights Framework for Artificial Intelligence Governance. Carr Center for Human Rights Policy, Harvard Kennedy School.

Coming up in 11.3: we turn from speculative futures to the institutional present — what journals, funders, and peer-review systems have actually done in response to AI in research over the last 18 months, and one large recent study showing that, despite ~70% of journals now having AI-disclosure policies, only about 0.1% of post-2023 papers actually disclose AI use. The gap between policy and practice is the central feature of the landscape we will look at next.